Computational Linguistics

نویسندگان

  • Lisette Appelo
  • Theo Janssen
  • Franciska de Jong
چکیده

This book describes the theoretical underpinnings and results of Rosetta, a machine translation (MT) project that started at Philips Research Laboratory in the early 1980's; the book focuses on research carried out between 1985 and 1992. While the project was a collective enterprise among a large number of people (as the pen name indicates), the principal authors were Lisette Appelo, Theo Janssen, Franciska de Jong, and Jan Lands-bergen. The book provides a coherent distillation of dozens of publications; in particular, the work described therein has served as the basis of at least two doctoral dissertations (Janssen 1986; Appelo 1993), as well as numerous technical reports, conference papers, and journal articles. In addition to providing a novel framework for examining issues in MT, the book covers several topics that are tied together within a compositional, inter-lingual approach based on Montague grammar. Thus, it serves as a valuable resource for researchers in MT and computational linguistics; with some work it could also serve as a course textbook. 1 The main thesis of the book is that MT can be a good carrier for linguistic and computational research; thus, the emphasis is on modeling linguistic knowledge involved in the translation of natural languages (English, Dutch, and Spanish). The two main principles threaded throughout the book are: Principle of Compositionality: Two expressions are each other's translation if they are built up from parts which are each other's translation, by means of translation-equivalent rules. Principle of Isomorphism: Two sentences are considered translations of each other if they have the same semantic derivation trees (hence corresponding syntactic derivation trees). Within this framework there is a close relation between syntax and semantics: the meaning of an expression is a function of the meaning of its parts, which, in turn are deened by syntax. The translation relation corresponds to the \tuning" of grammars across language 1 For example, it might be possible to include some programming problems at the end of each chapter that would allow the student to incrementally build a mini-translation system between two languages using the principles of the Rosetta approach. An additional instructional tool would be the provision of software for use in the development of such a system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corpora for computational linguistics 1 Running head : CORPORA FOR COMPUTATIONAL LINGUISTICS Corpora for computational linguistics

Since the mid 90s corpora has become very important for computational linguistics. This paper offers a survey of how they are currently used in different fields of the discipline, with particular emphasis on anaphora and coreference resolution, automatic summarisation and term extraction. Their influence on other fields is also briefly discussed. Corpora for computational linguistics 3 Corpora ...

متن کامل

Linguistics from a Computational Perspective─ Review of Computational Linguistics and Beyond

Since the advent of the computer in 1945, computational research has by now become pervasive in just about all newly created as well as traditional fields. Nor has linguistics escaped this tidal surge. In fact, a pre-computer computational perspective had already been attempted when linguists examined their data utilizing scientific methods; computational linguistics was formalized after the em...

متن کامل

Statistical Methods Statistical Methods#computational Linguistics#machine Learning#stochastic Grammars

" Statistical methods " refers here specifically to statistical methods in computational linguistics. This represents a new body of practice in computational linguistics that has become standard over the last decade.

متن کامل

Computational linguistics beyond the processing of English

Processing of the English language is overwhelmingly mainstream in computational linguistics. This text claims that this situation is neither healthy for computational linguistics nor theoretically tenable.

متن کامل

Psychocomputational Linguistics: A Gateway to the Computational Linguistics Curriculum

Computational modeling of human language processes is a small but growing subfield of computational linguistics. This paper describes a course that makes use of recent research in psychocomputational modeling as a framework to introduce a number of mainstream computational linguistics concepts to an audience of linguistics, cognitive science and computer science doctoral students. The emphasis ...

متن کامل

Lexicalization and Generative Power in CCG

Marco Kuhlmann, Alexander Koller and Giorgio Satta, Lexicalization and Generative Power in CCG, 2015, Computational linguistics Association for Computational Linguistics (Print), (41), 2, 215-247. http://dx.doi.org/10.1162/COLI_a_00219 Copyright: Massachusetts Institute of Technology Press (MIT Press): STM Titles / MIT Press http://mitpress.mit.edu/main/home/default.asp?sid=19E29805-C0A0-4642-8...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994